Time Related Factors of Data Accuracy, Completeness, and Currency in Multi-Channel Information Systems

نویسندگان

  • Cinzia Cappiello
  • Chiara Francalanci
  • Barbara Pernici
چکیده

Multi-channel information systems involve data redundancy, which, in turn, requires the implementation of synchronization mechanisms among overlapping databases. This need for periodic data realignment causes data quality issues related to delays in data updates, which can vary across databases and corresponding channels and services. Outof-date data reduce the overall currency of databases and, in turn, decrease their accuracy and completeness. This paper proposes time-related measures of accuracy, completeness and currency and discusses their mutual relationships in multi-channel information systems. 1 Data accuracy, completeness and currency Critical data quality issues are related to the degree of data integration, which is a consequence of design choices on the overall architecture of a company’s information system (IS). This paper focuses on multi-channel information systems in which services are typically provided through a variety of channels, including branches, agents, call centers, Web sites and mobile devices. Customers should be able to access consistent information through all channels and at all times, irrespective of the service that they request. This level of data quality requires a fully integrated multi-channel information system, which, unfortunately, is rarely implemented. More often, systems are composed of different applications with separate, redundant and possibly inconsistent databases. Data quality problems caused by redundancy and misalignments influence three fundamental data quality dimensions: accuracy, completeness and currency [4]. Different definitions of accuracy are provided by the data quality literature [5], [6]. This paper adopts the definition proposed in [5], where accuracy is associated with data values and is defined as a measure of the proximity of a data value v to some other value v ́ that is considered correct. A measure of accuracy can be associated with data sources. It can be defined as the ratio between the number of correct values and the total number of values available from a given source [5]. The definition of data completeness is consistent across research contributions. In [5], completeness is associated with data values and is defined as the degree to which a specific database includes all the values corresponding to a complete representation of a given set of real word events as database entities. According to this definition, it is possible to obtain an objective measure of the completeness of a data source by adding up significant data values. Currency is not provided a standard definition in the literature. This paper adopts the definition proposed in [1], where currency is defined as a time interval that goes from the time when data are updated and the time when data are used. The multi-channel architecture with the highest degree of data redundancy and, correspondingly, lowest degree of integration is shown in Figure 1. Each channel-functionality combination is served by a different software application and each application has access to a separate operational database odij . Two strategies can be distinguished to improve the Fig. 1. IS architecture with the lowest degree of data integration degree of integration of the architecture shown in Figure 1: – Channel integration strategy : each channel has a private database which is shared among multiple functionalities. – Functional integration strategy : each functionality has a private database that is shared among multiple channels. In a multi-channel information system that is not fully integrated there is necessarily data redundancy among operational databases. Data sharing across operational databases raises a need for periodic data realignments. Each pair of operational databases odij and odmn can be supposed to be aligned with a refresh period rtij,mn, which can be seen as the time interval before the data used by the m functionality of the n channel is updated with data created or modified by the i functionality of the j channel, and vice versa. To describe the behavior of customers, operational databases are associated with two parameters: – The operational frequency ofij , defined as the average frequency with which users of the i functionality of the j channel change a data unit of operational database odij . – The combined frequency cfij,mn, defined as the average frequency with which users of the i functionality of the j channel change a data unit in odij ⋂ odmn In order to guarantee a given level of accuracy, completeness and currency, refresh periods should decrease as the combined frequency of data changes increases. Based on these variables, accuracy, completeness and currency can be provided a mathematical measure consistent with the theoretical definitions provided in [5], and [1], respectively. A discussion of the mathematical method that has been used to calculate them can be found in [3]. 2 Simulation results Simulations compare the mathematical measures of accuracy, completeness and currency in architectures with different degrees of data integration. The following types of financial institutions are considered: – Global, national and regional institutions, depending on their size and geographical presence. – Physical or virtual institutions; the latter exclusively operate through technolog based channels, while the former also operate through physical branches. Users are classified into three categories depending on the average number of transactions that they complete over time: Active users, executing more than 200 transactions per year; Moderate users, executing more than 20, but less than 200 transactions per year; Sleepy users, executing less than 20 transactions per year. Different types of financial institutions have a different mix of active, moderate and sleepy customers, with different patterns of access to services across channels and, consequently, different values of the operating and combined frequencies. According to simulation results, global and national institutions show a trade-off between accuracy and completeness. While the former is maximized by the channel integration strategy, the latter increases with a functional integration strategy. A similar trade-off is also found between currency and completeness. As an explanation, accuracy and currency are significantly reduced if data changes from transactions performed by customers are not updated on all channels in real time; they are less sensitive to transactions that are not performed by a customer, but add new customer data, such as an inbound payment that must be credited to a customer’s account. Customers will realize that they have received a payment simultaneously from all channels even if the architecture is not fully channel integrated. Conversely, completeness is highly sensitive to the creation of new customer data, even if they are a consequence of inbound transactions. Therefore, from a data quality standpoint, global and national financial institutions should choose a channel integration strategy. It should be noted that according to the professional literature both strategies seem to be followed in practice by global and national financial institutions. Note that the trade-off between accuracy and completeness disappears in favour of the functional integration strategy if customers are supposed to distribute evenly among channels. Currency, accuracy and completeness are maximized by a functional integration strategy in regional financial institutions. The low frequency of operational transactions in regional institutions is a likely reason for these trends. Results confirm from a data quality standpoint the advantages of the functional integration strategy that is reported to be the followed in practice by most regional banks [2]. Virtual financial institutions show a trade-off between currency, accuracy and completeness, similar to global and national institutions. However, the tradeoff disappears in favour of the functional integration strategy for high values of refresh time. These results indicate that virtual institutions can implement a channel integration strategy only if their architectures supports frequent realignments among channels. Conversely, if the refresh period is longer than 60 hours (3 days) a functional integration strategy should be implemented to guarantee maximum accuracy and completeness. This seems to raise data quality issues for virtual institutions sharing branches with multiple physical institutions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Validation of Volunteered Geographic Information Landuse Change Using Satellite Imagery

Land use change monitoring is one of the main concerns of managers and urban planners due to human activities and unbalanced physical development in urban areas. In this paper, a combination of remote sensing data and volunteered geographic information was used to assess the quality of volunteered geographic information on land use and land cover changes monitoring. For this purpose, the ORBVIE...

متن کامل

تاثیرات برنامه حذف مالاریا بر روی کیفیت نظام مراقبت مالاریا در کشور ایران

  Background and Aim : The National Malaria Control Program was developed, in 2011, into the National Malaria Surveillance Program. It is one of the most comprehensive surveillance systems in Iran. The aim of this study was to evaluate the impact of malaria elimination program on data quality and accuracy in the national malaria surveillance system.   Materials and Methods : This was a cross...

متن کامل

Assessment of the completeness of Volunteered Geographic Information focusing on building blocks data (Case Study: Tehran metropolis)

Open Street Map (OSM) is currently the largest collection of volunteered geographic data, widely used in many projects as an alternative to or integrated with authoritative data. However, the quality of these data has been one of the obstacles to the widely use of it. In this article, from among the elements related to the quality of volunteered geographic data, we have tried to examine the com...

متن کامل

Improving Accuracy of Recommender Systems using Social Network Information and Longitudinal Data

The rapid development of technology, the Internet, and the development of electronic commerce have led to the emergence of recommender systems. These systems will assist the users in finding and selecting their desired items. The accuracy of the advice in recommender systems is one of the main challenges of these systems. Regarding the fuzzy systems capabilities in determining the borders of us...

متن کامل

Time-Varying Frequency Fading Channel Tracking In OFDM-PLNC System, Using Kalman Filter

Physical-layer network coding (PLNC) has the ability to drastically improve the throughput of multi-source wireless communication systems. In this paper, we focus on the problem of channel tracking in a Decode-and-Forward (DF) OFDM PLNC system. We proposed a Kalman Filter-based algorithm for tracking the frequency/time fading channel in this system. Tracking of the channel is performed in the t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003